Commercial suitability and characterization of newly developed Erucastrum canariense (Can) sterile cytoplasm based cytoplasmic male sterile (CMS) lines in Indian cauliflower

The study presents the first to characterize novel Erucastrum canarianse Webb and Berthel (or Can) sterile cytoplasm-based CMS lines in Indian cauliflower (Brassica oleracea var. botrytis L.) and investigating their commercial suitability. Eleven Can-based CMS lines were examined for 12 agro-morphological and yield traits,18 floral traits, four seed yield traits together with three each of the Ogura (source: wild Japanese Radish) and Tour (Source: Brassica tournefortii) cytoplasms. All of the recorded floral and seed traits showed significant (P > 0.05) differences between the CMS lines of each group. Agro-morphological and yield traits in CMS lines and their maintainers, however, were non-significantly different. All the Can- and Ogura-based CMS lines showed flowering and appropriate seed formation by natural cross-pollination. Only two Tour cytoplasm-based CMS lines, Tour (DC-41-5) and Tour (DC-67), produced the smallest malformed flowers and stigma. The highest seed yield per plant in CMS lines was in Ogu (DC-98-4) and the lowest in Tour (DC-67). P14 and P15, two polymorphic mtDNA markers, were discovered for the Can CMS system for early detection. Five primers (ITS5a-ITS4, atpF-atpH, P16, rbeL and trnL), along with their maintainers, were sequenced and aligned to detect nucleotide changes including as additions and or deletions at different positions. The newly introduced E. canariense sterile cytoplasm-based CMS system in cauliflower is the subject of the first comprehensive report, which emphasises their potential as a further stable and reliable genetic mechanism for hybrid breeding.

The CMS lines and their maintainers were observed for 12 agro-morphological and curd yield traits, 18 floral traits and four seed yield traits.Plant height (PH), plant spread (PS), leaf length (LL), leaf width (LW), number of leaves per plants (NLP), gross plant weight (GPW), marketable curd weight (MCW), net curd weight (NCW), root length (RL) and root width (RW) were recorded from five random plants in each plot at curd maturity or harvesting stage.Marketable curd yield (MCY, t/ha) was calculated from plot yield.The harvest index (HI) was obtained by using the flowing formula: HI = MCW (g)/GPW (g).
The 18 floral traits were recorded each of 10 random plants at the flowering stage in CMS lines and their maintainers grown in replicated trials with three replications.The observations were taken using 25 to 30 fully developed floral buds (for floral bud traits) or naturally opened flowers (for flower traits) from each plant.Flower length (FL), flower diameter (FD), stalk length (StaL), petal length (PL), petal width (PW), sepal length (SL), sepal width (SW), bud length (BL) and bud width (BW) were recorded using standard rural scale.Stigma length (StiL), long stamen length (LSL), short stamen length (SSL), long filament length (LFL), short filament length (SFL), anther length (AL), nectary length (NL) and nectary width (NW) were taken using Discovery.18 SteREO Zoom Microscope (Carl Zeiss Microimaging Gmbh, Germany) in Division of Floriculture and Landscaping, IARI, New Delhi.Fully opened flowers from already covered inflorescence branches were taken and nectar was collected in calibrated capillary tubes (Sigma-Aldrich, Missouri, USA).The nectar volume (NV, μl/10 flowers) was calculated by dividing the length of the nectar column in the capillary tubes by the total length of the capillary tube 42 .
For seed parameters, all the 17 CMS lines were grown alongwith fertile pollen-parent in 1:1 ratio in three replications at two isolation blocks namely Mela Gate Farm (MG) and Vegetable Research Farm (VRF) located ≈1000 m apart in ICAR-IARI, New Delhi.Pollen parent DC-67 and DC-98-4 were planted at MG and VRF site, respectively.The seed parameters namely siliqua length (SqL), number of seeds (NS) per siliqua, seed yield (SY) per plant and 1000-seed weight or test weight (TW) were recorded from five randomly tagged plants.

DNA extraction and PCR analysis
The DNA of CMS lines and their maintainers was extracted from fresh mature leaf samples from healthy young plants by using CTAB protocol 43 and stored at −80 °C for further molecular work.The sampling for DNA extraction was done from five plants of each genotype (CMS and maintainers).The DNA quality was checked using agarose gel and DNA concentration was measured using Nanodrop (Eppendorf).The DNA was diluted with 0.1 X TE buffer (10 mM Tris, 0.1 mM EDTA, pH 8.0) to yield a working concentration of 30 ng/μl.An equal volume of this working DNA sample from each of the five plants was mixed and made a representative sample for the respective genotype.
A set of 25 mt-DNA primers available in the public domain were synthesised by Integrated DNA Technologies, (IDT, Coraville, Iowa, USA) and screened for amplification using polymerase chain reaction (PCR) (Biometra Tone 96G, Analytik Jena, Germany).The details of the primers and their sequence information are given in Table S1.These were synthesized from Integrated DNA Technologies, Inc (IDT, USA).All the primers were amplified using 10 μl PCR mixture volume with 2X Master mix (GeneDireX® OnePCR™, Forward and Reverse Primer 1.0 μL (0.5 μL each, 25 μM), Template DNA (1.0 μL, 30 ng/μL) and Sterile double distilled water (3.0 μL, 75 ng/μL) using thermocycler.The PCR steps were: Initial denaturation at 94 °C for 5 min followed by 35 cycles of denaturation at 94 °C for 1 min, annealing at 55-65 °C for 1 min (varied with primer), extension at 72 °C for 1 min; final extension at 72 °C for 10 min and hold at 4 °C for 30 min followed by storing at 4 °C for further processing.

Electrophoresis and fragment detection
The amplified PCR products were resolved on 3% agarose gel in 100 mL 1X TAE buffer using 1 μg/mL of ethidium bromide florescent dye in gel electrophoresis (BioRad, USA).DNA bands were visualized and photographed under UV light in gel documentation unit (Alpha imager Cell Biosciences, Santa Clara, CA).The molecular markers showing variation in banding pattern among the genotypes were considered as polymorphic while rest were counted as monomorphic.Amplicon size was measured using the DNA ladder (GeneDireX Inc., 50 bp) as a reference.

Statistical analysis
The data from agro-morphological, floral and seed yielding traits were recorded and subjected "doebioresearch" package of R software for ANOVA (analysis of variance).Standard deviation and per cent change due to CMS introgression were calculated over the fertile (maintainer) line for agro-moprhological and floral traits using Microsoft Excel 2019.K-means clustering was implemented using "cluster" package of R software.The optimal number of clusters was determined by average silhouette width.Average silhouette width was computed employing "factoextra" package of R software.The "Hmisc" R package was utilized to perform correlation analysis on both agro-morphological and floral traits.Non-significant correlations, determined at a significance level of 5%, have been marked with a strikeout.Principal Component Analysis (PCA) has been employed using the "factominer" package of R Software.Factorial RBD design has been employed using the "doebioresearch" package of R software.

Statement regarding plant guidelines
The use of plant parts in the study complies with international, national, and/or institutional guidelines.

Analysis of variance (ANOVA) for traits
Table 2 provides the results of the analysis of variance for agro-morphological features observed in early cauliflower CMS lines and their maintainers.There were significant differences among the groups (CMS lines and

Variation in CMS lines and fertile maintainers for floral traits
The ANOVA for 18 floral traits in CMS lines and their maintainers is Table 4. Table 5 displays average values of 18 floral traits observed in CMS lines and their maintainers.The box plot of floral traits showed wide range (minimum, maximum), median, first quartile (Q1) and third quartile (Q3) in CMS lines (Fig. 1c) and their maintainers (Fig. 1d).Box plot analysis also revealed large variation in floral traits such as FL, FW, PL, PW, BW, StiL and NL in CMS lines.The cytoplasmic sterility reduced overall flower size and size of reproductive organs (stigma and stamens) in CMS lines (Fig. 2a).The flower parts (sepals, petals, stigma and anthers) of CMS lines and their maintainers of early cauliflower are presented in Fig. 2b.All the observed floral traits showed a significant difference in the CMS lines and maintainers.StiL in CMS lines was ranged from 0.65 to 1.03 cm which was significantly less than the maintainers (0.69-1.05 cm).Observations on length of stamens (i.e.LSL and SSL) and anther filament (LFL and SFL) showed significant differences in CMS lines and their maintainers.NL was maximum in Ogu(DC-67) (1.77 mm) and NW in Ogu (DC-41-5) (1.83 mm) while in maintainers, the observed maximum values were 2.06 mm (DC-63) and 1.95 mm (DC-8), respectively.NV also varied significantly in CMS lines (0.34-0.46 µL/10 flowers) and maintainers (0.48-0.56 µL/10 flowers).

Can cytoplasm-associated changes in agro-morphological and floral traits
In comparison to their maintainers, the introgression of nuclear genome of Indian cauliflower genotypes in sterile cytoplasms had significant impact on all 12 of the observed morphological traits (Fig. 3).PL changed in range of −16.1% in Can (DC-94-2) to 9.1% in Can (DC-98-4) and PS reduced the greatest in Can (DC-8) (−20.8%).LL and LW also influenced by sterile cytoplasms in the ranges of -26.2% to 6.1% and -17.7% to 11.2%, respectively.Except for Can (DC-23), all Can based CMS lines showed reduction in NLP.Impact of Can cytoplasm was noticed on GPW (-21.7 to 16.6%), MCW (-19.6 to 12.0%) and NCW (up to 13.4%).The changes in RL and RW ranged from -23.3% to 15.0% and -22.1% to 19.1%, respectively.The sterile cytoplasms also affected MCY, which was decreased in eight CMS lines while slightly increased in 3 CMS lines by 9.6 to 14.0%.Can (DC-98-4) experienced the greatest decline (19.5%).HI was also influenced in both the directions.

Impact of sterile cytoplasms on agro-morphological traits
Nine CMS lines developed through introgression of Tour, Ogura and Can cytoplasms in nuclear backgrounds of DC-98-4, DC-67 and DC-41-5 were observed for SqL, NS/siliqua, SY (g/plant) and TW (g).The sterile cytoplasms influenced seed traits and the greatest reduction was observed in Tour-based CMS lines (Fig. 5a-d).www.nature.com/scientificreports/While, the TW was found to be in the same range, SqL and NS/siliqua showed significant different.SY was significantly affected due to CMS introgression particularly due to Tour sterile cytoplasm.
Contribution of variables of agro-morphological traits in variation in CMS lines and maintainers is presented in (Fig. S2a, b). and their floral traits in (Fig. S2c, d).RL, GPW, HI and PS are important determinant show similarity in CMS lines while GPW, HI, NCW, MCY and MCW in maintainers .SL, FL, StaL, SW, NL, LFL, StiL, PL, FW and BL in CMS lines and SSL, LSL, LFL, SFL, FW, BW, NV, NL and StaL were contributors in variance in maintainers.
Bi-plot analysis for agro-morphological traits (Fig. 6a, b) and floral traits (Fig. 6c, d) in CMS lines and maintainers in early group of Indian cauliflower using "FactoMiner" R Package.This plot interpreted the similarity in most of the agro-morphological traits.HI, RL and GPW showed similarity in CMS lines whereas all the traits showed similarity in maintainers except GPW.For floral traits, LSL and SSL showed similarity in CMS lines whereas all the traits showed similarity in maintainers except SL.Cluster analysis of CMS lines and their maintainers for agro-morphological trait (Fig. 7a, b) and floral traits (Fig. 7c, d) could reveal three and two clusters, respectively.
Correlation analysis between agro-morphological traits in CMS lines and maintainers is presented in Fig. 8a, b and their floral traits in Fig. 8c, d.A significant positive correlation was observed among the traits such as NCW with MCW and MCY.RL positively correlated with HI and NL positively correlated with RW while GPW negatively correlated with PS and PS negatively correlated with LL in CMS lines.In maintainers MCW positively correlated with MCY and NCW positively correlated with MCW while PS negatively correlated with HI.In CMS lines, RL was strongly correlated with NLP, RW and HI.In floral traits, LFL has strong correlation with FW and NW.StiL significantly correlated with StaL and SL in CMS lines.In maintainers, LFL was strongly correlated with NV, SSL, SFL and FW (Fig. 8c, d).SSL significantly correlated with NV and SFL and BW with NL.These correlation studies are significant at 5% level of significance.

Mitochondrial specific DNA markers for CMS lines
The observations for amplification and level of polymorphism in CMS lines with 25 mtDNA markers are presented in Fig. S3.Two markers, namely P14, P15 and P16 were polymorphic in CMS and fertile maintainers.The marker P14 amplified fragments of 200 bp in CMS lines (lane 1-6, 8,9) and that of 250 bp in fertile maintainers (lane 15-20), whereas P15 amplified distinct bands of 200 bp in CMS lines (lane 3-8) and 250 bp in fertile maintainers (lane 13-18), respectively.P14 showed polymorphism between CMS lines carrying Ogura (lane 21, 22, 24) and canarianse (lane 1-6, 8, 9) CMS systems and their fertile counterparts.It also generated a unique band (175 bp) for Tour cytoplasm (lane 23), separating it from Ogura and canarianse cytoplasms.P15 could distinguish canarianse (lane 3-8) from Ogura (lane 21, 22, 24)) and Tour (lane 23) cytoplasms, however, it could not differentiate between the latter two systems.Notably, intra-CMS system variation was also observed among the Can cytoplasm carrying CMS lines such as Can (DC-209) (lane 7) and Can (DC-63) (lane 10) which amplified an amplicon of 250 bp while rest of the eight Can based CMS lines generated 200 bp with P14 primers.Similar observations were made from P15 which could generate three different bands in Can based CMS lines.P16 revealed a polymorphic band (450 bp) for Ogura (DC-45) (lane 24) while its maintainer DC-41-5 (lane 18) amplified 500 bp amplicon.

Sequence analysis
The results of alignment of amplicons produced by the eight genotypes employed in the study using ITS 5a-ITS 4 primer are shown in Fig. S3a.The aligned sequences were filtered from both the ends to remove the gaps and the central 614 residues were finally analyzed.In the aligned sequences, the sequences bearing similarity are shaded with the same colour e.g.A is shaded green, T red, C blue and G black.The unshaded residues are indicating that the residue in question is variable in the compared genotypes.There was not much variation in the analysed sequence up to 460 bases.In this portion, the sequences were observed to be same in all the genotypes except DC-23 and DC-98-4.It was observed that there was C in place of A at 95th position, G in place of T at 163rd position, C in place of T at 402nd, C in place of T at 416th position in DC-23 and DC-98-4.At the same time genotype Can (DC-41-5) showed presence of G at this position (416th).At position 432nd there was C in Can (DC-41-5) while G in rest of the genotypes.Similar analysis of the sequences revealed that only DC-23, DC-98-4 and Can (DC-41-5) showed different pattern as compared to the rest of the genotypes.The results of alignment of amplicons obtained using atpF-atpH primer from the eight genotypes (5 CMS lines and 3 fertile maintainers), two maintainers were common for four CMS lines (Fig. S3b).The central 429 residues were finally analysed.In the aligned sequences, the sequences bearing similarity are shaded with the same colour as in case of ITS 5a-ITS4.The lines showed variation at different positions for nucleotides.The sequence alignment of amplicons obtained using P16 primer from the eight genotypes (Fig. S3c) revealed no variation between Can (DC-23) and DC-23, while Can (DC-98-4) and Can (DC-41-5) showed variation due to addition and deletion of nucleotides at different positions.In case of rbeL primer (Fig. S3d).The central 663 residues were finally analyzed.In the aligned sequences no variation was detected between Can (DC-23) and DC-23 while Can (DC-98-4) and Can (DC-41-5) showed variation due to the addition and deletion of nucleotides at different positions.The sequence alignment of amplicons obtained using trnL primer from eight genotypes (Fig. S3e) could show addition and deletion of nucleotides in Can (DC41-5).It has additions at 14th, 15th, 60th, 74th, and 80th positions.Can (DC-41-5) had one deletion of nucleotides at 22nd, 32nd, 42nd, 48th, 50th and 57th positions.

Discussion
In terms of agro-morphological features, the CMS lines carrying Can and Tour sterile cytoplasms are on par with the CMS lines carrying Ogura sterile cytoplasm.However, there was a significant difference in floral traits between the cauliflower CMS lines carrying these three sterile cytoplasms.Tour cytoplasm-based CMS lines namely Tour (DC-41-5) and Tour (DC-67) showed a considerable reduction in floral traits.

Variations in agro-morphological, floral and seed traits
Cauliflower is a crop that was introduced to the Indian subcontinent, and the original genotypes had a temperate flowering habit meaning that they needed a protracted period of cold temperature for proper bolting and flowering for seed production.Newly evolved cauliflower had varied morphologies in terms of stalk length, leaf orientation, leaf size and colour, curd size and colour and plant frame 19 .'Indian cauliflower' or 'tropical cauliflower' was the name given to the evolved ecotype particularly to the early and mid-early groups.The cytoplasmic male sterility is widely used genetic mechanism for hybrid breeding in Brassica crops.It arises as a result of a genomic conflict between the mitochondrial and the nuclear genomes.The Ogura sterile cytoplasm was first deployed in European cauliflowers 35 and later on introgressed in Indian cauliflowers 26,38 .Still it is a robust and stable genetic system that ensures complete cross-pollination across the temperature range.The modification or alteration in the mitochondrial genome that causes male sterility may substantially impair CMS system performance.This www.nature.com/scientificreports/has been documented in the 'Polima' cytoplasm where partial restoration of fertility has occurred as a result of higher temperature 44 .As hybrid breeding in cauliflower currently solely depends on the Ogura CMS system, this will be a major calamity, if a Polima-like situation occurs.Thus, two additional sterile cytoplasms namely E. canariense (Can) and B. tournefortii Goun (Tour) 4,35 .This CMS introgression process might had an impact on other non-target but important traits such as agro-moprphological and floral traits.Because, normal size of flowers, properly developed petals, normal stigma and style and well-developed functional nectaries are required for drawing honey bees for pollination activities.The previous researchers have showed the negative effects of Ogura sterile cytoplasm on floral traits including flower size, petal length and width, stamens, stigma and nectary size as well as nectar volume 37,[45][46][47] .These defects/deformities included reduced flower size, petaloid anthers, carpeloid stamens, malformed nectary, twisted stigma and split anthers in Brassica crops.A reduction in floral traits in CMS lines of Indian cauliflower carrying Ogura, Can and Tour sterile cytoplasm was also reported 22,38 .
Size of functional nectaries is correlated with nectar volume 38,48 .And, the pollinator insects in Brassica crops are attracted to flowers based on the amount of nectar they produce [49][50][51] .
In current investigation, the pattern of the per cent change in plant morphology, curd yield and floral traits revealed varied results.The parameters of PH, PS and leaf attributes showed a variable pattern.Maximum reduction in PH was recorded in DC 98-4 (−16.1%), while overall gain was seen in Ogu (DC-98-1) and Can (DC-7) in Ogura and Can cytoplasm-based CMS lines, respectively.Three CMS lines namely Can (DC-903) (12.0%),Can (DC-209) (7.5%) and Can (DC-18) (3.5%), which were considerably greater than other two CMS systems, demonstrated good gains in MCW.Dey et al. 37 and Singh et al. 38 both made similar observations.The MCY from Can-based CMS lines (21.8 t/ha) was nearly equal to that of the CMS lines carrying Ogura (21.8 t/ha) and Tour (21.4 t/ha) cytoplasm.This might be explained by nuclear-cytoplasmic interactions and cooperative functions between particular nuclear backgrounds of cauliflower and alien cytoplasms (i.e.sterile cytoplasm from other species).These alloplasmic lines, which carry cytoplasms namely Can, refined Ogura and Tour from three different sources-E.canariense (complete cytoplasm), Raphanus sativus L. (only mitochondria) and B. tournifortti (complete cytoplasm), respectively.Refined Ogura based CMS lines carry chloroplast of B. oleracea, a unique functional combination in plant system.
The conversion procedure from fertile inbred lines to CMS lines had impact on floral features.Both FL, FD, PL and PW were decreased significantly.The findings are consistent with earlier studies in broccoli 46 , snowball cauliflower 37,48 and Indian cauliflower 22 .The conversion process significantly decreased the size of the nectaries (NL, NW).The findings align with earlier reports 22,48 .NV also affected due to CMS trait.This might be the www.nature.com/scientificreports/reason for a negative effect on seed yield in honey bee pollinated crops.The reduction in floral characteristics agreed with the findings of Dey et al. 41 In comparison to their fertile lines, they reported a considerable decline in floral characteristics in refined Ogura based CMS lines of snowball cauliflower.However, all the Can and refined Ogura carrying CMS lines had decent size of flowers for honey bee visits (based on field observations of the senior author).Among the Tour-based CMS lines, only Tour (DC98-4) had a proper flower while the remaining Tour (DC-67) and Tour (DC-41-5) had most malformed flowers with 'gripping phenomenon' as also noted by Singh et al. 22 The CMS phenotype is associated with abnormal recombination of mitochondria  genome and CMS genes (i.e.orf138 in Ogura CMS) encodes transmembrane proteins (ORF138) which bind to the mitochondrial membrane, affecting the hydrogen ion concentration gradient, thus affecting ATP synthesis.This leads to energy deficiency and excess accumulation of reactive oxygen species (ROS).The ROS play key role in cell signaling and their excess accumulation causes damage to lipids, proteins and DNA, inhibit enzyme activity, activate the programmed cell death (PCD) pathway and ultimately affect plant growth and development, particularly reproductive parts 52 .
The high seed yield in natural crossing conditions is essential for economic use of CMS system in hybrid breeding.The CMS trait impact most of floral traits which reflects in reduced seed yield also, however, in comparison, certain CMS systems have better performance than others 53 .The seed yield per plant in the refined Ogura-based CMS lines of snowball cauliflower namely Ogu1A, Ogu2A, and Ogu3A was reported to be 30.9g, 21.4 g and 26.8 g per plant, respectively 54 .They reported that the seed yield per plant for these CMS lines was 24.3%, 29.3% and 10.1% less than its maintainer, respectively.Kucera et al. 55 reported a reduction in seed yield per plant by 79.5% and 85.7% in FT CMS and BR CMS, respectively.It was attributed to fewer siliqua per plant and a smaller number of seeds per siliqua than their maintainers.The observed decrease in seed yield due to CMS trait in present study was consistent with the earlier findings 37,55 .However, this is the first in-depth account of changes brought on by Can sterile cytoplasm in agro-morphological and floral features of cauliflower.Similar to this, it was found that 11 Can-based CMS lines with the two pollen parents DC-67 and DC-98-4 produced hybrid seeds on average at rates of 19.5 g and 12.8 g/plant, respectively.Three Ogura based CMS lines with same pollen parents produced hybrid seeds 14.7 g and 18.5 g/plant, respectively.The Tour-based CMS lines, particularly Tour (DC-41-5) and Tour (DC 67) were worst performers for hybrid seed yield, mainly due to gripping phenomena of stigma 38 .High yield of hybrid seeds in present study could be due to genotype, open condition and nicking of CMS and pollen parents.Kucera et al. 55 obtained only 2.3 g/plant hybrid seeds from BR CMS line with pollen parent FT13 grown in isolation cages provided with bumble bees.Thus, the study demonstrates the potential of these novel Can sterile cytoplasm carrying CMS lines in production of hybrid seeds.
The PCA and cluster analysis identified key agro-morphological and floral traits which contribute major share of variance in the CMS lines.The potential for trait comprehension for breeding purposes is indicated by the strong association between root and leaf parameters, curd weight and yield characteristics, and stamen length and nectary traits.

mtDNA level molecular variation in CMS lines
At flowering stage, the CMS phenotype is persistent and simple to distinguish from fertile analogues.It is challenging to distinguish them at earlier stages, though.Further, the deployment of several sterile cytoplasms in cauliflower makes it challenging to distinguish these cytoplasms carrying CMS lines at a morphological level.Breeders will therefore benefit from the creation or discovery of DNA markers to differentiate between these several sterile cytoplasms.Additionally, because these markers are stage-independent, they can be applied to any specious mixture during the seedling stage or before flowering.Mitochondrial markers are thought to be a reliable method for identifying sterile and fertile cells as well as the type of cytoplasm that CMS lines carry 56,57 .
A dominant marker called ISSR3 was used by Wang and Song 58 to distinguish fertile counterparts from the CMS lines.According to them, it is possible to distinguish between fertile and sterile lines using an amplicon of 313 bp size using a primer pair P9/P10 for mtDNA marker specific to the CMS cauliflower knxd612.Only two of the 25 mtDNA markers namely P14 and P15 were polymorphic between CMS and male fertile phenotypes.further investigation to establish the association.Primer atpF-atpH amplicon sequence revealed unique bases at manifold position indicating its distinctness from the other three cytoplasms.Earlier reports also suggest variation among the Ogura-cytoplasm carrying CMS lines of cauliflower 25 and broccoli 31 .Deletions of nucleotide in the ORF coding region of broccoli cytolines was suggested to be associated with carpelloid stamen phenotype 31 , however, such abnormality was not reported in investigated CMS lines.

Conclusion
The present work provides the first in-depth information on the effect of this sterile cytoplasm in new nucleocytoplasmic combination (oleracea-canerianse) lines of Indian cauliflower.Erucastrum canarianse Webb and Berthel is a new CMS system deployed in cauliflower.These lines contain the cauliflower nuclear genome,

Figure 1 .
Figure 1.Box plot of 12 agro-morphological traits of Canarianse based CMS lines (a) and their maintainers (b) in early group of Indian cauliflower.Box plot of 18 floral traits of E. canarianse based CMS lines (c) and their maintainers (d) in early group of Indian cauliflower.
www.nature.com/scientificreports/ at two positions i.e. at 166th position A is deleted and T is deleted at 299th position.Five conversions were also observed at 178th, 258th, 318th, 370th and 429th positions.The sequence analysis revealed variation among the sterile cytoplasms and also in fertile inbreds for the observed sequences.The deletion was also observed in Can (DC-98-4) at 299th position but not in Can (DC-41-5) indicating change in sequences.At 323th position, the sequence alignment of atpF-atpH showed difference in the CMS lines namely Can (DC-41-5), Ogu (DC-41-5) and Tour (DC-41-5) and their maintainers DC-41-5.These CMS lines had T, while maintainer DC-41-5 had A.

Figure 2 .
Figure 2. (a) Flowers of fertile maintainers (top side) and CMS lines (bottom side) of cauliflower.(b) Flower floral parts (sepals, petals, stigma and anthers) of CMS lines and maintainers in early group of Indian cauliflower.

Figure 3 .
Figure 3. Per cent change in agro-morphological traits due to ingression of Can sterile cytoplasm in genotypes of early group of Indian cauliflower.

Figure 4 .
Figure 4. Per cent change in floral traits due to ingression of Can sterile cytoplasm in genotypes of early group of Indian cauliflower.

Figure 5 .
Figure 5. (a-d) Seed parameters from selected CMS lines and their maintainers in genotypes of early group of Indian cauliflower.

Figure 6 .
Figure 6.Principal component analysis (PCA) of agro-morphological traits (a, b) and floral traits (c, d) in CMS lines and maintainers in early group of Indian cauliflower using "FactoMiner" R Package.

Figure 7 .
Figure 7. (a-d) Cluster analysis of CMS lines and their maintainers for agro-morphological trait (a, b) and floral traits (c, d) in early group of Indian cauliflower.

Table 1 .
List of Can, Ogura and Tour CMS lines and their maintainers used for study on agro-morphological and molecular characterization of Indian cauliflower.

Table 2 .
Analysis of variance (ANOVA) for agro-morphological and curding traits observed from CMS lines carrying Can, Ogura and Tour cytoplasm's and their maintainers in Indian cauliflower.DF Degree of freedom.
*Significant at 5% level of significance when tested against MSS due to error.

Table 3 .
Agro-morphological and curding traits observed from CMS lines carrying Can, Ogura and Tour cytoplasm's and their maintainers in Indian cauliflower.CD critical difference, CV coefficient of variation.

Table 4 .
Analysis of variance (ANOVA) for floral traits observed from CMS lines carrying Can, Ogura and Tour cytoplasm's and their maintainers in Indian cauliflower.DF Degree of freedom.*Significant at 5% level of significance when tested against MSS due to error.

Table 5 .
Floral traits observed from CMS lines carrying Can, Ogura and Tour cytoplasms and their maintainers in Indian cauliflower.CD critical difference, CV coefficient of variation.

Table 6 .
Continuedin mtDNA sequences and spotting variation between CMS lines and their maintainers.Similarly, P16 was found to be useful to distinguish Ogu (DC-41-5) from its maintainer DC-41-5.In the instance of sequence analysis, the Can, Ogu and Tour cytoplasm harbouring CMS lines displayed variation.These novels Can CMS lines of Indian cauliflower creates the possibility for immediate use in hybrid breeding or further introgression in other members of B. oleracea group through backcrossing.Observations on F 1 hybrid seed produced from Can, Ogura and Tour cytoplasm carrying CMS lines in Indian cauliflower.CD critical difference, CV coefficient of variation.